Verifying LVCSR Output at Different Levels with Generalized Posterior Probability

نویسندگان

  • Frank K. SOONG
  • Wai-Kit LO
  • Satoshi NAKAMURA
چکیده

Generalized posterior probability (GPP), a statistical confidence measure, is used for verification of large vocabulary continuous speech recognition (LVCSR) output at subword, word and utterance levels. GPP is obtained by combining exponentially and optimally weighted products of acoustic and language model scores for reappeared units in the reduced search space (e.g., word graph). Experimental results have demonstrated the effectiveness of GPP for verifying LVCSR output at all three levels. Keyword confidence measure, posterior probability, large vocabulary continuous speech recognition 1 The author is now with Microsoft Research Asia.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generalized Posterior Probability for Verifying Recognized Words Optimally in Microphone Array Applications

In a large vocabulary, continuous speech recognition (LVCSR) system, spoken input is converted into a string of hypothesized, possibly erroneous, words. However, the current state-of-the-art speech recognition technology is still not robust to all variability in speech signals, especially in a hands-free application. To make the signal pick-up from a speech source more immuned to noise or room ...

متن کامل

Context constrained-generalized posterior probability for verifying phone transcriptions

A new statistical confidence measure, Context ConstrainedGeneralized Posterior probability (CC-GPP), is proposed for verifying phone transcriptions in speech databases. Different from generalized posterior probability (GPP), CC-GPP is computed by considering string hypotheses that bear a focused phone with partially matched left and right contexts. Parameters used for CC-GPP include context win...

متن کامل

Background model based posterior probability for measuring confidence

Word posterior probability (WPP) computed over LVCSR word graphs has been used successfully in measuring confidence of speech recognition output. However, for certain applications the word graph is too sparse to warrant reliable WPP estimation. In this paper, we incorporate subword units as background models to generate a subword graph for estimating posterior probability. Experiments on both E...

متن کامل

Confidence measures from local posterior probability estimates

In this paper we introduce a set of related confidence measures for large vocabulary continuous speech recognition (LVCSR) based on local phone posterior probability estimates output by an acceptor HMM acoustic model. In addition to their computational efficiency, these confidence measures are attractive as they may be applied at the state-, phone-, wordor utterance-levels, potentially enabling...

متن کامل

Performance improvement of dialog speech translation by rejecting unreliable utterances

We discuss how to measure the reliability of recognized utterances based on a confidence measure, and applied it to a dialog speech translation system. In this study, we employ generalized word posterior probability (GWPP), a confidence measure for verifying recognized words, and expand it to measure the reliability of recognized utterances. We confirmed the performance improvement by applying ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004